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ABSTRACT 

Primordial non-Gaussianity of local type is predicted to lead to enhanced halo clustering on 
very large scales. Photometric quasars, which can be seen from cosmological rcdshifts z > 2 even 
in wide-shallow optical surveys, are promising tracers for constraining non-Gaussianity using 
this effect. However, large-scale systematics can also mimic this signature of non-Gaussianity. 
In order to assess the contribution of systematic effects, we cross-correlate overdensity maps of 
photometric quasars from the Sloan Digital Sky Survey (SDSS) Data Release 6 (DR6) in different 
redshift ranges. We find that the maps are significantly correlated on large scales, even though we 
expect the angular distributions of quasars at different redshifts to be uncorrelated. This implies 
that the quasar maps are contaminated with systematic errors. We investigate the use of external 
templates that provide information on the spatial dependence of potential systematic errors to 
reduce the level of spurious clustering in the quasar data. We find that templates associated with 
stellar density, the stellar color locus, airmass, and seeing are major contaminants of the quasar 
maps, with seeing having the largest effect. Using template projection, we are able to decrease 
the significance of the cross-correlation measurement on the largest scales from 9.2(T to 5.4(T. 
Although this is an improvement, the remaining cross-correlation suggests the contamination in 
this quasar sample is too great to allow a competitive constraint on /nl by correlations internal 
to this sample. The SDSS quasar catalog exhibits spurious number density fluctuations of ~2% 
rms, and we need a contamination level less than 1% (0.6%) in order to measure values of /nl 
less than 100 (10). Properly dealing with these systematics will be paramount for future large 
scale structure surveys that seek to constrain non-Gaussianity. 

Subject headings: Quasars and Active Galactic Nuclei, Astrophysical Data, Data Analysis and Techniques 



Introduction 



Inflation is the standard paradigm for the gen- 
eration of perturbations in matter density that 
produ ces large-scale structure (LSS) in the Uni- 
verse (IStarobinskiilll979l:lGuthl ll98lt lLindelll982 : 
Albrecht fc Steinhardt]ll982h . Although the infla- 



tionary paradigm has successfully predicted vari- 
ous properties of the observable universe, includ- 



of perturbations (IMukhanov fc Chibisov 1981 


lHawkinell982| 


Guth fc Pilll982: StarobinskiJl982 


iBardecn et al\ 


1983|). the correct model of infla- 



tion has yet to be conflrnicd. The simplest in- 



1 



flation models predict nearly Gaussian primor- 



— ^, ^ — ^ v^,-,— ^^^^^^ 

as multi-field inflation (Li 


Qde & Muk 


lanovl 


Bernardeau & Uzan 20021: 


Lvth et all 


2003) 



posit a departure from a Gaussian distribu- 
tion. Alternatives to inflation, such as the ekpy- 
rotic model of cyclic expansion and contraction 
( Khourv et all I2OOII: Isteinhardt fc Turokl l2002t 
Creminelh fc Senatord l2007t ). also predict non- 
Gaussian primordial perturbations. Since a de- 
tection of non-Gaussianity would discriminate be- 
tween these fundamentally different models, much 
work is being done to constrain non-Gaussianity, 
both in LSS through the galaxy distribution and 
through anisotropics in the cosmic microwave 
background (CMB). Primordial non-Gaussianity 
is readily probed through measurements of the 
bispectrum of the CMB, in which a nonzero mea- 
surement constitutes a "smoking gun" detection, 
modulo any systematic effects. Some alterna- 
tive probes of non-Gaussianity in LSS include 
the galaxy bispectrum, which is plagued by non- 
linearities, and galaxy cluster abundances and 
dark-matter halo clustering, which suffer from 
low-number statistics. The accepted parametriza- 
tion of primordial non-Gaussianity is to introduce 
a quadratic term to the primordial potential 
written as 



$ = + /nl(0'-(0'», 



(1) 



wher e (j} is a Gaussian rand om field (jKomatsu fc Spergel 
200 ll : iGangui et aZlll994[ ). This form describes 
local-type non-Gaussianity with an amplitude 
/nl- The latest constraint on /nl is from 
the Wilkin son Microwave Anisotropy Probe's 
(WMAP) ||WMAP Collaboration! |2005|) Seven- 
Year bispectrun i, which gives —10 < fm. < +74 



at 95% G.L. (iKomatsu aLl l201l[ ). Planck 
( Planck Collaboration! 2006 ). which will soon give 
results from its first data release, is expected to 
produce constraints on /nl of order (t(/nl) ~ 7 



(|Coorav et aT!!2008!) 

One useful effect of non-Gaussianity that has 
gained much attention, and which we explore in 
this analysis, is a distinct sca le-dependen t clus 



2007 



tering bias on l arge scales (jPalal et al. 

Slosar et al\\200^ . Specifically, it has been shown 
that /NL-type non-Gaussianity produces a shift 
in the bias that behaves as Ab{k) oc /nl/^^; 
this can suppress or enhance clustering on the 



largest scales. Various authors have used the 
large-scale angular power spectra of LSS trac- 
ers t o constrain fm, ass uming a scale-dependent 
bias. !Slosar elTali (!2008), using the Sl oan Digital 



Sky Survey CS DSS) (lYork et oLl I2OOOI) Da ta Re- 
lease 5 (DR5) (jAdelman-McGarthv et U] !2007! ). 
derived the constraints —82 < /nl < +70 at 
95% C.L. using the photometric quasar sample 
and -29 < /nl < +70 at 95% C.L. when these 
quasars are combined with other data sets, such 
as the integrated Sachs- Wolfe effect (ISW) and 
luminous re d galaxies (LRGs). This analysis was 
exten ded in!Xia et ali (!201l!) to includ e the SDSS 
DR6 (|Adelman-McCarthv erall!2008!) photomet 



ric qu asar catalog compiled bv iRichards et al. 



( 2009 ). hereafter RQCat. !Tseliakhovich et al. 



20101 ) extended this analysis to a two-p arameter 



curvaton model (Linde fc Mukhanov!!l997: MoUerach 



199C 


■ 


2005 





19901 : !Lvth fc Wandsl2002! : !Lvth eV aL!!2003! : !Boubekeur fc Lvth 



use t his method in combination with other dat a 
sets (!Xia et all !2010l: !peBernardis et all !2010!) . 



DeBernardis all (120101) a l so sho wed that Planck 



and Euclid ( Amiaux et all !2012! ) together could 
possibly detect /nl 5. Finally, work on using 



multiple tracers of different bias (|Hamaus et al. 



2011! ) has predicted future limits of /nl ~ 0.1 



As in all auto-spectrum measurements, partic- 
ularly on large angular scales, potential correla- 
tions caused by systematic effects must be re- 
moved to isolate the correlation signal from the 
LSS tracer. Systematics have been a concern in 
recent analyses of the B aryon Oscillation Spec - 



troscopic Survey (BOSS) (lEisenstein et aOl201ll). 
Stud i es of BOSS pho tometric galaxies (|Ross et al. 
2011! : !Ho et aT!!2012!) attempted to remove system- 
atic effects by fitting an amplitude for each sys- 
tematic template, and subtracting its estimated 
contribu tion from the power spectrum. Recent 
work in !Huterer et all (!2012h outlines a formal- 
ism for how photometric calibration variations af- 
fect power spectrum and cosmological parame- 
ter estimations. Another method for removing 
systematics is mode projection, which explicitly 
marginalizes over the amplitude of each template 
that traces the spatial dependence of a potential 
systematic error. This method has been profitably 
used for foreground re moval in the low- £ CMB 
power spectrum (e.g. Slosar et al. 2004 ). This 
method is more robust than subtracting the esti- 
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mated contamination from the power spectrum es- 
timator, because it is exactly insensitive to any sys- 
tematic that traces the spatial dependence of the 
template. It also avoids the possibility of "over- 
subtraction" - the phenomenon in which chance 
correlations of the real galaxy density with sys- 
tematics templates lead to an overestimate of the 
contamination, and hence lead to a downward bias 
in the \ow-£ power spectrum. 

In this paper, we attempt to remove large-scale 
systematics from the latest photometric quasar 
sample available, which is RQCat. We investigate 
the photometric quasar sample because, by prob- 
ing large rcdshifts, quasars are able to probe scale- 
dependent bias more effective ly than other matte r 

(I2OO8I) . 



tracers, as demonstrated in Slosar et al 



We use photometric quasars from a larger redshift 
range than this previous analysis, using redshifts 
as low as z = 0.9 and as high as z = 2.9. We 
divide this sample into three redshift slices (0.9- 
1.3, 1.6-2.0, 2.3-2.9) and construct three angular 
cross-power spectra to test if they are consistent 
with zero, which we expect since quasars at dif- 
ferent redshifts should be uncorrelated on large 
scales. We also construct templates of various po- 
tential systematic effects and try mode-projecting 
them from the angular cross-power spectrum esti- 
mator. 



Note that the sample used by e.g. ISlosar et al. 



(j2008[ ) was constructed by a different algorithm 
and is not the restriction of the RQCat sample to 
an earlier, smaller footprint. Therefore the specific 
amplitude of systematic errors herein should not 
be taken as quantitatively indicative of the level 
of contamination in these previous analyses. 

We find that without mode-projecting any sys- 
tematics, the first and second redshift slices are 
correlated with an 9.2cr significance, and the first 
and third redshift slices are correlated with a 2.7cr 
significance, both on large scales. This implies 
that systematics are heavily contaminating large- 
scale correlations in the quasar maps. Using mode 
projection, we find significant reductions in the 
correlations between redshift slices after remov- 
ing templates corresponding to the stellar den- 
sity, stellar locus offsets, airmass, and seeing. We 
are able to decrease the significance of the corre- 
lation between the first and third redshift slices 
to 1.8cr; however, the significance of the correla- 
tion between the first and second redshift slices 



decreased only to 5.4cr, which is indicative of sig- 
nificant contamination. While the quasar cata- 
log exhibits ^2% contamination, measuring /nl 
will require a contamination level less than 1%. 
We conclude that other systematic effects are still 
present that are contaminating the SDSS photo- 
metric quasar sample. We must improve methods 
of construct cleaner photometric quasar samples if 
we hope to use them to provide competitive con- 
straints on /nl on large scales. 

The plan of our paper is as follows: in Sec. [2] we 
describe the photometric quasar sample we use in 
the analysis. In Sec.[3]we describe briefly the the- 
ory behind scale-dependent bias and how it affects 
the angular power spectrum. In Sec. 21 we outline 
the estimators we use, the method of mode projec- 
tion, and the systematics for which we search. In 
Sec. [5l we give the angular cross-power spectrum 
measurements, before and after mode projection, 
and we discuss its implications. We conclude in 
Sec. |6l Wherever not explicitly mentioned, we as- 
sume for the background cosmology and power 
spectrum a flat ACDM cosmology with param- 
eters compatible w ith the WMAP7 data release 
( Larson et a/.ll2011 ). 



2. Choice of Sample 

W e use the photometric quasars fro m the SDSS 
DR6 (jAdelman-McCarthv et a/.ll2008l ) to trace the 
matter density and construct its angular spec- 
trum. The SPSS consists of a 2.5-m telescope 
( Gunn et al\ 2006fl w i th a 5 - filter (ugriz) imaging 
camera (jGunn et al\ 119981 ) and a spectrograph. 
Automated pipel ines are responsi ble for the astro- 
metric solution (I Pier et aZ.ll2003h and photomet 



2001 



ric c al ibration (IFukugita et al\ 119961: iHogg et al. 



Tucker et 



2006t 



Padmanabhan et al. 



20081 ). Bright galaxies, luminous red galaxies 
(LRGs), and quasars are selected for follow-up 
spect r oscopy ( Strauss a/.ll20o3 Eisenstein et al. 



I2OOII: [Richards et all l2002tlBianton et all l200l[) 



The data used here were acquired between Au- 
gust 1998 and J une 2006 and are included in SPS S 
Pata Release 6 (jAdelman-McCarthv et aLll2008l ). 

Specifically, we use the p hotometric qua s ar cat - 
alog (RQCat) composed by Richards et all ( 2009f ). 
The entire catalog consists of 1,172,157 objects 
from 8417 deg^ on the sky selected as quasars 
from the SPSS PR6 photometric imaging data. 
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Quasars are the brightest objects at large redshifts 
{z > 1), making them better tracers of the matter 
density at large scales than LRGs. We limit our 
dataset in this analysis to UV-excess {u — g < 1.0) 
quasars. Specifically, we implement this choice 
by requiring the catalog columns good > and 
uvxts = 1. Wc divide quasars into 3 redshift 



shces (zp^ 



mm; '^p,max 



) = (0.9,1.3), (1.6,2.0), and 



(2.3,2.9). We plot the three redshift distributions 
in Fig. [TJ while their properties are given in Ta- 
ble [H The procedure for constructing the redshift 
distributions is described in the appendix and i s 



2008h . 



similar to that described in Ref. (jHo et al 
For the survey geometry we use the DR6 survey 
mask as a union of the survey runs retrieved from 
the SDSS CAS server. We omitted runs 2189 and 
2190 because many objects in these runs were cut 
from RQCat. This mask was pixelized using the 
MANGLE software (|Hamilton fc Tegmarkl 12003 : 
Swanson et a/.l 120081 ). We pixelize the quasars as 
a number overdensity, Sg = (n — n)/n, onto a 
HEALPix pixehzation (jCorski et a/.ll2005f) of the 
sphere with Nres = 256, where n is the pixel's 
number of quasars divided by the pixel's survey 
coverage w and n (J2i "^i)- 

We try to clean our sample by removing pix- 
els that we suspect are contaminated. We re- 
ject pixels with extinction E{B — V) > 0.05, 
full widths at half-maximum of its point-spread 
function (PSF) FWHM > 2 arcsec, and stellar 
densities (smoothed with a 2° FWHM Gaussian) 
n stars > 562 stars/deg^ (twice th e avera ge stel- 



lar density), similar to iHo et al\ (|2008l ). The 



extinction cut is very important because a high 
extinction affects the u band, which is crucial to 
identifying quasars. Also, since stars tend to be 
misidentified as quasars, it seems prudent to cut 
regions with high stellar density. Wc impl ement 



these cuts using dust maps from Schlcgc l et al. 



(|1998[ ) and a stellar overdensity map, smoothed 
with a 2° FWHM Gaussian, constructed from 
stars (18.0 < r < 18.5) with i < 21.3 (cutoff 
for point sources in RQGat) from the SDSS DR6 
(jAdelman-McGarthv et al\\200^ . We also reject 
pixels for which the survey region covers less than 
80% of the pixel area. We also excised several rect- 
angular regions that appeared to contain missing 
data; the angular rectangles in equatorial (J2000) 
coordinates that were removed are (a, 6) ~ 
(122°-139°, -1.5°-(-0.5)°), (121°-126°, 0°-4°), 



(119°-128°, 4°-6°), (111°-119°, 6°-25°), 
(111.5°-117.5°, 25°-30°), (110°-116°, 32°-35°), 
(246°-251°, 8.5°-13.5°), (255°-270°, 20°-40°), 
(268°-271°,46°-49°), and (232°-240°, 26°-30°). 
Finally, we cut regions comprising HEALPix 
A^sido = 64 pixels that contained no stars for which 
to estimate the stellar color locus and flux-error 
locus offset systematic templates, which use pho- 
tometric measurements of stars, as well as cut 
pixels with color locus offsets greater than 5 mag 
(see Sec. 14. 3p . After these cuts, the survey region 
comprises 122,780 pixels covering a solid angle of 
6440 deg^. The quasar maps for each slice are 
shown in Fig. [2l Note that we reduce a large- 
scale systematic by implementing an additional 
cut on the quasars in the highest redshift bin 
to exhibit a high kernel-density-estimator (KDE) 
quasar probability density, requiring the catalog 
column qsodens > (sec Fig. [2]). 

3. Theory 

3.1. Scale-dependent bias due to a non- 
Gaussian primordial inhomogeneity 

The derivation for the scale-dependent bias 
model for non-Ga ussianity of t he "l ocal" (/nl) 
type is detailed in ISlosar et al\ (j20081 ). for which 
we give a brief overview. The model assumes the 
halo model scenario (see ICoorav fc ShethI (l2002h 
for a detailed review) in which all matter is con- 
tained in a distribution of halos on large scales 
which mimics the distribution of matter on small 
scales. In the Gaussian case, fluctuations on small 
and large scales are uncorrelated. However, a 
signature of local-type non-Gaussianity is that 



Table 1: Properties of the 3 QSO photometric red- 
shift slices; Zp is the photometric redshift range, 
and Zmoan IS thc mean (true) redshift of the slice, 
and Nqso is the number of QSOs in the redshift 
shce. 



Label 


Zp 




Nqso 


zOl 


0.9-1.3 


1.230 


75,835 


z02 


1.6-2.0 


1.731 


91,356 


z03 


2.3-2.9 


2.210 


10,806 



4 



z= 1.231 



z = 1.742 



z = 2.227 




Fig. 2. — Top: The QSO density in the 3 photometric redshift shccs. The 180° radius caps arc displayed 
in a Lambert Azimuthal Equal- Area Projection, with the North Galactic Pole at the map center, / = 0° at 
right, and I — 90° at bottom. The labels indicate the characteristic redshift of each slice. Bottom: The QSO 
density in redshift slice z03 with no KDE QSO density cut. Notice the large-scales fluctuations that would 
ruin scale-dependent bias measurements. 
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Fig. 1. — The redshift distributions for the QSO 
photometric redshift shces zOl (solid), z02 (dot- 
ted) and z03 (dashed). 



the /nl0^ term in the gravitational potential (see 
Eq. [1]) causes small-scale matter fluctuations to 
correlate with large-scale halo fluctuations due 
to mixing of their respective gravity perturba- 
tions. Positive (negative) /nl incurs a positive 
(negative) correlation between scales and an in- 
crease (decrease) in the halo bias on large scales as 
compared to the Gaussian case. Specifically, this 
scale-dependent shift in the bias was derived for a 



genera l halo mass function n{M) in ISlosar et al. 
(|2008f ) to be 



A6(M, k) 



3 In n 



NL- 



c^k^T{k)D{z)-'"^ din as 



(2) 



where k is the wavenumber corresponding to the 
length scale probed, T{k) is the transfer function, 
D{z) is the the growth factor normalized such that 
D{z = 0) = 1. c is the speed of light, Hq is 
the Hubble parameter today, and cts is the rms 



Table 2: Estimates for QSO redshift distribution 
amplitude Ai for each redshift slice with la errors. 



Label 


A, 


zOl 


1.77±0.165 


z02 


1.88±0.145 


z03 


2.13±0.869 



overdensity in a sphere of radius R = 8h~^ Mpc. 
For the case of a uni versal mass functio n, such as 
the Press-Sche chter (Press fc Schechtcr 19741 ) or 
Sheth-Tormen (jSheth fc Tormenlll99S I mass func- 
tions, this expression was shown in ISlosar et al. 



(|2008l) to reduce to 



Ab{M,k)^3h^{b-p)5, 



k^T{k)D{z) 



the expression first derived in iDalal et all (|2007t ). 
where flm is the matter density today relative to 
the critical density for a flat universe, and Sc is 
the critical density of spherical collapse. The pa- 
rameter p ranges from 1 for LRGs, which popu- 
late all halos equally, to 1.6 for quasars that pop- 
ulate only recently merged halos. We use Eq. 
with p = 1.6 in our analysis to model the scale- 
dependent bias of quasars. 

3.2. Angular pov^fer spectrum due to scale- 
dependent bias and non-Gaussianity 

The clustering bias is a key component of the 
matter power spectrum A(fc), thus we expect 
scale-dependent bias to affect it as well as the an- 
gular power spectrum Ci. Since we are interested 
in large correlations in this analysis, we cannot use 
the small-scale Limber approximation, but must 
use the full expression for Ct including rcdshift- 
space distortions. These expressions are given for 
th e scale-independent b i as cas e in Eqs. 29 and 30 



in Padmanabhan et al. (l20n7^ and for the scale- 
dependent bias case in ISlosar et ali ([20081). We 



assume the linear matter power spectrum in our 
analysi^. The Cgs were calculated for all 3 red- 
shift slices and presented in Fig. [3j including for 
the cases of nonzero /nl- 

4. Methodology 

Non-Gaussianity, which produces a shift in the 
bias of the form Ab{k) cx /nl/^^, manifests itself 
on large angular scales. If any systematic effect 
can increase power on large scales, we will be un- 
able to distinguish it from scale-dependent bias. 
One way to attempt to assess the contamination 



ISlosar et al. I ||2008| ') confirmed that nonlinearitics arc neg- 
ligible for k < O.lh Mpc~^. For our redshifts, this corre- 
sponds to ^ < 270. We only use Ci for £ < 250, so it is safe 
to use the linear matter power spectrum in our analysis. 
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50 100 150 200 250 
I 



50 100 150 200 250 
I 





50 100 150 200 250 
i 



10 20 30 40 50 
I 



Fig. 3. — The predicted QSO angular power spec- 
tra for null and nonzero /nl- The top- left and 
bottom-right panels are both for the zOl red- 
shift bin, except that the bottom-right panel's £- 
range ends at £ = 50 to see the \ow-i behavior 
of the spectrum. The solid lines are the spectra 
for the Gaussian case, and the dotted (dashed) 
lines represent the /nl = 100 (—100) case. The 
dot-dashed line represents /nl = 10. Wc nor- 
malized the /nl =7^ cases such that the vari- 
ances of number density fluctuations within a pixel 
(oc ^g{2£ + l)Ce) for these cases are equal to the 
value for /nl = 0. 



of our quasar maps would be to cross-correlate the 
quasar maps with each of the system atics and see 
if it is nonzero, which was done in Slosar et al. 



(|2008[ ) with stars and red stars. However, this 
will be difficult to do for the other systematics we 
consider because the errors are uncertain. Thus, 
in this analysis we compare angular cross-power 
spectra before and after mode-projecting an in- 
dividual systematic to see if it decreases. More 
specifically, we first calculate the angular cross- 
power spectrum without attempting to remove 
any systematics. We then recalculate it, except 
that we mode-project one systematic. Finally, we 
compare the two to see if the spectra are signif- 
icantly different. Since there are three redshift 
bins, there will be 3 cross-spectra. 

The cross-spectra are useful because we expect 
different quasar maps to be uncorrelated if they 
are clean since objects from separate redshift bins 



should not correlate with each other H However, if 
the same systematic effect appears in two quasar 
maps, then their cross-spectra will be non-zero. 
So our goal for each cross-spectra is to identify 
those systematics for which mode-projection de- 
creases the magnitude of the cross-spectra, and 
then try mode-projecting this set of systematics 
together to see if the cross-spectra goes to zero, 
which would be the limit of the usefulness of this 
test. If it remains non-zero, then we know that 
there remains systematic effects in the maps that 
are unidentified. Of course, this will only iden- 
tify those systematics which contaminate two or 
all three quasar maps. There may be systematic 
effects that contaminate only one of the maps that 
would not appear in the cross-spectra yet still con- 
taminate any estimates of /nl using auto-spectra. 
We decide not to search for this case since the 
auto-spectra we would need for the test would also 
be used to to constrain /nl- However, this case 
should be considered in future work. 

4.1. Angular power spectrum estimators 

In order to construct our angular power spec- 
trum estimator, we begin by constructing the data 
vector 



Xl 
X2 



(4) 



where Xi and X2 length- A^pix vectors constituting 
the two maps we are cross-correlating, making x 
have a length of 2iVpix. The covariance matrix for 
X is given by 



Si+Ni 
S2 + N2 



(5) 



where Si and S2 (Ni and N2) are the signal 
(noise) covariance matrices for Map 1 and Map 2, 
respectively, and we neglect any cross-correlation 
between the maps. S is given by 



2£ + l 
in 



WeCtPi[cos{h, ■ h,)] , (6) 



^Evcn with a null contamination, there will be a finite cross- 
correlation between the maps because the redshift distribu- 
tions for the bins do overlap, but it is much smaller than the 
typical errors on the cross-correlation measurement. We 
plot the fiducial cross-power spectra along with our cross- 
spectra estimates in Figs. |4] and \5\ where we see a finite 
cross-correlation on smaller scales. 
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where Ci is the theoretical angular power spec- 
trum (wc assume /nl = for our prior), Pi{x) is 
a Legcndrc polynomial and Wi is the pixel win- 
dow function, and we assume Poisson noise for N 
given by Nij = 5ij /n in terms of the mean number 
of galaxies per pixel n. Note that we project out 
the monopole and dipole of the power spectra by 
setting the Ci prior equal to 100 for ^ = and 1 
(this is equivalent to treating them by the same 
mode projection method wc use for the systemat- 
ics templates). 

We use a quadratic est imator ( Tegmark 1996t 
Padmanabhan et al. 20031) to measure the angular 
power spectra in iVbin = 4 £-bins with ranges (2- 
6, 7-11, 12-24, 25-36). Flat bandpowcrs allow us 
to write a parameter vector p = (Ci, C2, C3, (74) 
where the band amplitudes C„ are expressed as 
Ci = X]^=i C^nVe^ where 77" is a step function 
that is 1 when £ is in bin n and otherwise. Our 
estimator for p is constructed in the form p = 
F~^q, where 



form 



Fy- = -tr[C„C-iC„C-i] and 
q^ = ix^C-iC„C-ix 



(7) 



are the Fisher matrix and quadratic estimator 
vector, respectively, and C,^ = dC/dpi. Note 
the matrix inversion and trace estimation are 
done by the iterative and stochas ti c me thods 
described in detail in iHirata et all (|2004l ) and 
Padmanabhan et all (|2007l )l'^l 



4.2. Mode projection 

We use the method of mode projection to re- 
move modes due to systematics from our angular 
power spectrum estimators, instead of subtracting 
them directly, which could lead to spurious modes 
remaining. This method projects out systematics 
by in effect giving these modes large noise val- 
ues so that the estimator is rendered insensitive 



to them. Mode projection (Tegmark et al. 1997; 


Bond et al. 1998; Halverson et al. 


,2001.1 has been 


used often for CMB data (see 


Halverson et al. 



(|200ll ) for a review) , yet this method can also clean 
LSS analyses. We begin with the assumption that 
the systematic's effect on the QSO map x is of the 



^ Such methods were introduced to cosmology by lOh et al\ 



E 

i—sys 



(8) 



where the vector is the template for systematic 
i. We then project out the systematics by adding 
to the covariancc matrix C in both the quad vector 
and Fisher matrix (see Eq. [T]) the outer product 
of each systematic template with itself, according 
to 



C = C' 



(9) 



t—sys 



which yields an unbiased estimator when — 00. 
We set Q for each template large enough such that 
the resulting Fij and qt converge. 

4.3. Templates 

In this section we enumerate the various sys- 
tematics templates for which we search in RQCat. 
The systematics we consider are extinction, stel- 
lar contamination, red stellar contamination, stel- 
lar locus color offset, stellar locus flux-error offset, 
air mass, seeing, sky brightness, modified Julian 
date, camera column, and atmospheric refraction. 
We describe the possible effects of each system- 
atic below, as well as the construction of the tem- 
plates. We list the template numerical properties, 
including the template mean, standard deviation, 
C, and 5q vs. slopes, in Table |31 For air mass, 
seeing, sky brightness, MJD, camera column, and 
atmospheric refraction, the required values were 
given per SDSS scanning field, where each field 
is a spheric al rectangle on the s ky. We used the 
MANGLE (jSwanson et aZ.ll2008f) software to con- 
vert these fields to HEALPix pixels. Also, note 
that each template had its mean subtracted. 

Extinction (ext): The Galactic extinction was 
taken from the infr ared dust maps described in 
Schlegel et al. I (|l998h and given in terms of the red- 
dening E{B — V) in magnitudes. The quasar cata- 
log can be sensitive to any errors in the normaliza- 
tion or shape of the extinction law A\/E{B — V). 
Also, the extinction correction is sensitive to the 
intra-band spectral energy distributions (SED) of 
the source under consideration; quasars often have 
extreme intraband colors and this could lead to a 
variation of the quasar density determined from 
"extinction-corrected" photometry with E{B—V). 
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Table 3: Template properties for various systematics. We list each template's mean (4"), standard deviation 
(aiji), prefactor (C) for mode-projecting each template, and slope with each of quasar distributions from the 
three redshift slices. The slope (with appropriate units) of a template versus quasar distribution i is equal 
to the cross-correlation coefficient between them times <7s/<^^, where as ^ l/^/n. Note that the mean is 
subtracted from each template before the analysis commences. 



Systematic (units) 










c 


zOl slope 




z02 slope 




z03 slope 


ebv (mag) 


0.0251 


0.0109 






10^ 


+0.415 + 0.424 




-0.249 + 0.351 




-5.84+2.98 




+ 2.96 X 10"''' 


0.658 






10^ 


+0.0149+0.0070 




+0.0121 + 0.0058 




+0. 130+0.049 




+ 3.67 X 10~^ 


1.38 






10^ 


(+2.60 ± 3.35) X 10" 


3 


(+7.29 + 2.77) X 


10 


(+3.89 + 23-58) x 10~ 


ugr (mag) 


+0.0176 


. 0354 






io9 


-0.0453 + 0.1306 




-0.276 + 0.108 




+0.438 + 0.919 


gri (mag) 


+6.59 X 10-^ 


0.0282 






10^> 


-0.0852 + 0.1639 




(-8.98 + 135.61) 


X 10-^ 


-0.322+1.154 


riz (mag) 


+ 1-84 X 10"-^ 


0.0211 






10^ 


+0.201 + 0.219 




+0.401 + 0.181 




-0.0270 + 1.5419 


uerr (mag) 


-0.0164 


. 0207 






105 


-0.480 + 0.223 




-0.0589 + 0.1847 




-2.54+1.57 


airu (mag) 


1. 15 


0.0921 






106 


-0.0583 + 0.0502 




-0.0700 + 0.0415 




-0.0838 + 0.3532 


seeu (arcsec) 


1.54 


0.175 






10^ 


-0.176 + 0.026 




-0.124+0.022 




-0.198 + 0.186 


seer (arcsec) 


1.35 


0.164 






10^ 


-0.162 + 0.028 




-0.112 + 0.023 




-0.130 + 0.198 


skyu (maggies/arcsec^) 


22.2 


0.230 






106 


C-3.25 + 20.10) X 10 


-3 


+0.0265 + 0.0166 




-0.170 + 0.141 


skyi (maggies/arcsec^) 


20.3 


0.216 






10^ 


(-9.28 + 21.40) X 10 


-3 


+0.0464+0.0177 




-0.0936 + 0.1506 


mjd (days) 


52505 


552 






102 


(+3.4 ± S.4) X 10-^ 




(+1.92 + 0.69) X 


10~^ 


(+1.52 ± 0.59) X 10"^ 




+ 1.19 X 10"^ 


2.85 X 


10~ 


3 


io'3 


-2.08 + 1.62 




+1.45 + 1.34 




+1.33 + 11.42 




-7.11 X 10^^" 


2.85 X 


10" 


3 


lO"^ 


-0.324 + 1.622 




-0.697 + 1.342 




+0.125 + 11.416 




-8.42 X lO"^"^ 


2.85 X 


10~ 


3 


10^ 


+0.977 + 1.622 




+1.72 + 1.34 




+7.87 + 11.42 




-1.46 X 10~^ 


2.85 X 


10~ 


3 


10^ 


+4.44 + 1.62 




-4.73 + 1.34 




+8.80 + 11.42 




-2.92 X 10~^ 


2.85 X 


10~ 


3 


10^ 


-3.32 + 1.62 




-1.97 + 1.34 




-7.72 + 11.42 


ref 


-0.0596 


0.439 






io6 


C-2.59 ± 10.53) X 10 


-4 


C+5.00 ± 8.71) X 


10"^ 


+0.0299 + 7.41 



Table 4: Correlation coefficients between templates. Template abbreviations given in Sec. 14.31 

















ugr 


gr 






























i 




1 


cain2 








ref 




+1 


000 


+0 


327 


+0 


118 


+0 


016 


-0 


068 


-0 


252 


-0 


140 


+0 


004 


-0 


043 


-0 


087 


-0 


025 


-0 


069 


+0 


002 


+0 


017 


-0 


004 


-0 


010 


+0 


006 


-0 


008 


-0 


092 




+0 


327 


+1 


000 


+0 


364 


+0 


254 


+0 


082 


-0 


128 


-0 


182 


-0 


036 


-0 


164 


-0 


140 


-0 


Oil 


-0 


086 


-0 


006 


-0 


013 


+0 


024 


+0 


026 


+0 


009 


-0 


014 


+0 


Oil 




+0 


118 


+0 


364 


+1 


000 


+0 


082 


+0 


065 


+0 


035 


+0 


132 


-0 


056 


-0 


177 


-0 


127 


+0 


050 


+0 


005 


+0 


030 


-0 


026 


+0 


015 


+0 


005 


+0 


001 


-0 


022 


+0 


006 


ugr 


+0 


016 


+0 


254 


+0 


082 


+1 


000 


+0 


006 


+0 


018 


+0 


020 


-0 


006 


+0 


013 


+0 


Oil 


-0 


002 


-0 


016 


-0 


039 


-0 


015 


+0 


065 


+0 


019 


+0 


021 


-0 


009 


+0 


052 


gri 


-0 


068 


+0 


082 


+0 


065 


+0 


006 


+1 


000 


-0 


086 


+0 


121 


-0 


019 


+0 


024 


+0 


024 


+0 


079 


+0 


073 


-0 


012 


-0 


010 


-0 


040 


-0 


003 


-0 


003 


+0 


004 


+0 


035 




-0 


252 


-0 


128 


+0 


035 


+0 


018 


-0 


086 


+1 


000 


+0 


133 


-0 


103 


-0 


110 


-0 


021 


+0 


137 


+0 


084 


+0 


013 


+0 


002 


+0 


067 


-0 


018 


-0 


050 


-0 


019 


+0 


039 




-0 


140 


-0 


182 


+0 


132 


+0 


020 


+0 


121 


+0 


133 


+1 


000 


-0 


037 


+0 


012 


+0 


031 


+0 


116 


+0 


102 


+0 


016 


-0 


018 


-0 


003 


+0 


016 


+0 


019 


-0 


020 


+0 


031 




+0 


004 


-0 


036 


-0 


056 


-0 


006 


-0 


019 


-0 


103 


-0 


037 


+1 


000 


+0 


135 


+0 


114 


-0 


099 


-0 


041 


-0 


006 


-0 


016 


+0 


003 


-0 


001 


-0 


016 


-0 


009 


-0 


076 




-0 


043 


-0 


164 


-0 


177 


+0 


013 


+0 


024 


-0 


110 


+0 


012 


+0 


135 


+ 1 


000 


+0 


804 


-0 


119 


-0 


040 


-0 


123 


+0 


033 


-0 


065 


-0 


012 


-0 


020 


+0 


010 


-0 


027 




-0 


087 


-0 


140 


-0 


127 


+0 


Oil 


+0 


024 


-0 


021 


+0 


031 


+0 


114 


+0 


804 


+1 


000 


-0 


127 


-0 


058 


-0 


149 


+0 


040 


-0 


022 


-0 


002 


-0 


019 


-0 


017 


-0 


013 




-0 


025 


-0 


Oil 


+0 


050 


-0 


002 


+0 


079 


+0 


137 


+0 


116 


-0 


099 


-0 


119 


-0 


127 


+1 


000 


+0 


492 


+0 


041 


+0 


008 


-0 


039 


+0 


000 


+0 


014 


-0 


026 


+0 


077 




-0 


069 


-0 


086 


+0 


005 


-0 


016 


+0 


073 


+0 


084 


+0 


102 


-0 


041 


-0 


040 


-0 


058 


+0 


492 


+1 


000 


-0 


151 


-0 


008 


+0 


010 


+0 


014 


+0 


031 


-0 


002 


+0 


069 




+0 


002 


-0 


006 


+0 


030 


-0 


039 


-0 


012 


+0 


013 


+0 


016 


-0 


006 


-0 


123 


-0 


149 


+0 


041 


-0 


151 


+1 


000 


+0 


019 


-0 


025 


-0 


041 


-0 


007 


+0 


042 


-0 


111 




+0 


017 


-0 


013 


-0 


026 


-0 


015 


-0 


010 


+0 


002 


-0 


018 


-0 


016 


+0 


033 


+0 


040 


+0 


008 


-0 


008 


+0 


019 


+1 


000 


+0 


000 


+0 


000 


+0 


000 


+0 


000 


+0 


026 




-0 


004 


+0 


024 


+0 


015 


+0 


065 


-0 


040 


+0 


067 


-0 


003 


+0 


003 


-0 


065 


-0 


022 


-0 


039 


+0 


010 


-0 


025 


+0 


000 


+1 


000 


+0 


000 


+0 


000 


+0 


000 


-0 


006 




-0 


010 


+0 


026 


+0 


005 


+0 


019 


-0 


003 


-0 


018 


+0 


016 


-0 


001 


-0 


012 


-0 


002 


+0 


000 


+0 


014 


-0 


041 


+0 


000 


+0 


000 


+1 


000 


+0 


000 


+0 


000 


+0 


015 




+0 


006 


+0 


009 


+0 


001 


+0 


021 


-0 


003 


-0 


050 


+0 


019 


-0 


016 


-0 


020 


-0 


019 


+0 


014 


+0 


031 


-0 


007 


+0 


000 


+0 


000 


+0 


000 


+1 


000 


+0 


000 


+0 


022 




-0 


008 


-0 


014 


-0 


022 


-0 


009 


+0 


004 


-0 


019 


-0 


020 


-0 


009 


+0 


010 


-0 


017 


-0 


026 


-0 


002 


+0 


042 


+0 


000 


+0 


000 


+0 


000 


+0 


000 


+1 


000 


-0 


001 


ref 


-0 


092 


+0 


Oil 


+0 


006 


+0 


052 


+0 


035 


+0 


039 


+0 


031 


-0 


076 


-0 


027 


-0 


013 


+0 


077 


+0 


069 


-0 


111 


+0 


026 


+0 


006 


+0 


015 


-0 


022 


+0 


001 


+1 


000 
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Stellar contamination (star): We use the stellar 
overdensity map mentioned in Sec. [2] as the tem- 
plate. When constructing a quasar catalog, the 
main challenge is to separate stars from quasars, 
since both appear as point sources. The stellar 
contamination rate for RQCat is ~3% at lower 
redshifts, yet more precision may be required for 
non-Gaussianity measurements. 

Red stellar contamination (rstar): The tem- 
plate is similar to stellar contamination, except the 
stellar sample is narrowed further to include only 
those that also satisfy the color cut g — r > 1.4. 
Note that this template will also project our con- 
tamination from blue stars, since the "blue star" 
map is a linear combination of "stars" and "red 
stars." 

Stellar color locus offset (ugr, gri, riz): This is 
sensitive to position-dependent offsets in the pho- 
tometric calibration of the survey. We can mea- 
sure these variations through shifts in the stellar 
locus, or scatter plots of the stars between two col- 
ors. Templates were constructed, in terms of the 
ordinate vs. abscissa of the plot, for color loci 5 — r 
vs. u—g (ugr), r—i vs. g — r (gri), and i — z vs. r — i 
[riz) in magnitudes. We start by determining the 
color locus for the sample of stars used in our stel- 
lar overdensity map, which we then fit to a line. 
We then divide the sample into 49,152 HEALPix 
-^sido — 64 pixels, with each large pixel containing 
16 A'sido = 256 small pixels we use in our anal- 
ysis so that there are sufficient stars to measure 
the color locus offset. In each large pixel, we use 
its stars to construct a color locus for that pixel. 
We then take the median of the difference between 
each star's ordinate in the large pixel's color locus 
with the linear fit of the entire sample's color lo- 
cus as a measure of the offset. We then give the 
offset for all 16 small pixels the value of the offset 
calculated from the large pixel. 

Photometric errors (uerr): Statistical photo- 
metric errors can scatter objects into or out of 
the region of color space used for quasar selec- 
tion (or the region of color space that maps to 
a particular photo- z range), and thus affect the 
density of selected quasars. Variations of these 
errors across the sky can then produce spurious 
large scale power in the quasar maps. A template 
was used only for the u band. Wc construct the 
template in a manner similar to the stellar color 
locus offset template, except that (1) the ordinate 



and abscissa for the stellar loci are the flux error 
and flux (in magnitudes), respectively, and (2) in- 
stead of a linear fit, we just fit the stellar locus to 
a horizontal line (which is the same as averaging 
the ordinate values). 

Air mass (airu): Air mass is the path length 
of atmosphere a photon travels from its source to 
the telescope, normalized to unity at zenith; it de- 
pends on both sky position and observation time. 
Increased airmass increases both the statistical er- 
rors on the magnitudes of objects, as well as pro- 
ducing color terms since the bluest wavelengths 
are selectively attenuated. The 5 filters in SDSS 
are taken in rapid succession, so taking the air 
mass value from the u band observation should be 
sufficient for template projection purposes. The 
values for the template ar e taken from the Cata- 
log Archive Server (CAS) (I York et al\\2^ . 

Seeing (seeu, seer): Seeing is defined here as 
the FWHM of the point spread function (PSF) 
in a pixel, given in arcseconds from the SDSS 
CAS. It affects the photometric noise, de-blending 
of sources, and star-galaxy (i.e. point source- 
extended source) separation. Templates were con- 
structed for the u {seeu) and r [seer) bands. 

Sky brightness (skyu, skyi): Sky brightness is 
namely the brightness of the atmosphere during 
observation. Small-scale variations in the sky 
brightness can affect object identifications, in that 
the sky brightness is subtracted from the bands be- 
fore identification is performed. The values, given 
in maggies/arcsec^, are taken from the SDSS CAS. 
Templates were constructed for the u band (skyu), 
in which sky brightness is dominated by moonlight 
if present, and the i band (skyi), in which sky 
brightness is dominated by light from the intrinsic 
atmosphere. 

Modified Julian Date (mjd): The modified Ju- 
lian date (MJD) is a linear unit of time, given 
in days, to track astronomical observations. The 
MJD values for the SDSS scanning fields were 
taken from the SDSS CAS. Mode-projecting this 
template will project out modes from any system- 
atic that varies linearly with observation time. 

Camera Column (caml...cam5): The CCD 
camera for SDSS consists of six (6) camera 
columns (camcols). The CCDs arc highly cali- 
brated; however, an unknown malfunction in a 
camcol could misidentify objects, contaminating 



10 



the quasar density field. The camcols for the 
SDSS scanning fields are given in the SDSS CAS. 
A template for each camcol is almost binary, in 
that for camcol j , the template value for a pixel is 
1 if the pixel was observed by camcol j and if it 
was not. The "almost" is because the HEALPix 
pixels cross the scanning field boundaries, causing 
cases where a pixel is observed by multiple fields. 
In these cases the template value for camcol j is 
given a fractional value denoting the percentage 
of the observed part of the pixel that was observed 
by camcol j. For example, lets say a pixel was ob- 
served by two fields with different camcols, where 
camcols 2 and 4 observed 60% and 35% of the 
pixel, respectively, and 5% of the pixel was either 
not observed at all or observed by a bad field which 
had to be culled from the data. In this case, the 
templates for camcols 2 and 4 would have values 
60/(60+35)=0.63 and 35/(60+35)=0.37, respec- 
tively. It is easy to see that the sum of all the 
camcol templates must be a vector with all ones, 
which we call jQ- Thus, the template for camcol 6 
is linearly dependent on the templates for camcols 
1-5, making it unable to remove contaminated 
modes that are independent of the other camcols. 
Therefore we only mode-project camcols 1-5. We 
also orthogonalize these templates to speed the 
convergence of the estimator. 

Atmospheric Refraction (ref): Refraction of 
photons through the atmosphere can perturb the 
position of a source, distorting the density field 
and increasing/decreasing correlations. Since re- 
fraction is chromatic, the different bands will give 
different positions to a source. The template is de- 
signed to measure the elongation of the refraction- 
induced "rainbow" in the scan direction. The tem- 
plate is given in terms of the altitude [K) and posi- 
tion angle (0) of the telescope as cot^(/i) cos(20). 
which is proportional to the quadrupole field that 
describes the perturbations due to refraction. The 
h and 4> values for each scanning field were taken 
from the SDSS Data Archive Server (DAS). 

While each of the systematic templates has a 
"primary motivation," each one may be sensitive 
to several contaminating effects. For example, the 
stellar color locus test is obviously sensitive to pho- 
tometric calibration errors, but also contains some 
sensitivity to Galactic dust and to the variation 



in this paper is not equal to the identity matrix. 



of stellar populations over the sky. The stellar 
density maps were motivated by the possibility of 
stellar contamination, however they are also cor- 
related with the dust map since both are largest 
at low Galactic latitudes. However, each of these 
effects is independent of the true quasar density 
and correlations of the quasar map with any of 
them would be problematic; note also that since 
the template projection removes any linear com- 
bination of the input templates, there is no re- 
quirement that these templates be orthogonal, or 
sensitive to only a single root cause of systematic 
error. 

We also list the Pearson correlation coefficients 
for all the templates in Table H) We see that al- 
most all the templates have low correlations with 
other templates. Of the 171 possible template 
pairings, only 7 of them have correlations (posi- 
tive or negative) greater than ±20%. The only 
two templates correlated with each other by more 
than 50% are the seeing templates in the u and r 
bands (80%). This implies that mode-projecting 
this set of the templates should remove a diverse 
set of modes. 

Note that if the range of variation in conditions 
is large (e.g. as for the airmass or sky brightness), 
the spurious contribution to the quasar density 
may be nonlinear in the template - e.g. we may 
have 

"spurious = const + a{X -X) + I3{X - Xf + 

(10) 

where X is a template (e.g. the airmass) and /? is 
a nonlinearity coefficient. Since the template pro- 
jection method is a linear method, we can only re- 
move the /3-term by adding yet another template 
proportional to X^] in general, if multiple tem- 
plates have large variations we would also have to 
include products of template maps. We tested the 
mode-projection of products of templates for the 
cross-correlation of zOl and z02 since it has the 
larger cross-correlation. 

5. Results 

The cross-correlation power spectra between 
the redshift slices before mode projection are 
shown in Fig. |4l Out of 12 cross-power measure- 
ments (4 ^-bins times 3 cross-correlations), two 
exhibit significant cross-powers. The largest one 
is between slices zOl and z02, or C/^, with a value 
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of (1.63 ± 0.18) X 10"'' (9.2cr) in the first £-bin 
(2 < £ < 7). We see there is also a sizable cross- 
correlation between slices zOl and z03, or C}^, 
appearing in the second ^-bin {7 < £ < 12) with a 
valueof (7.5±2.8)xl0~^ (2.7a). Large-scale biases 
are not expected to be this high, particularly with 
the magnitude of C^. This leads us to suspect 
that systematic errors in these slices are corrupt- 
ing these quasar maps. The redshift slices z02 and 
z03 exhibit no significant cross-correlation. 

After mode-projecting each of our candidate 
systematic templates, we find several templates 
that significantly increased the cross-correlations. 
We list these templates for both cross-correlations 
in Tables [S] and [51 The biggest contributors by 



far to seem to be seeing in the u and r 



bands. Mode-projecting each of these individually 
reduce the cross-correlation significance by more 
than 3(7. Red star contamination, photometric 
calibration errors, and air mass also seem to play 
a role. Our uncovering of red star contaminatio n 
seems to be in agreement with lSlosar et "aLl (l2008l) . 
which found both of their samples of quasars to 
be contaminated by them. Mode-projecting all 
these templates yields a cross-correlation measure- 
ment of (1.03 ± 0.19) X 10-4 (5.4cr), which is 
closest to the value from u-band seeing. It ap- 
pears that seeing is a very important systematic 
effect in our quasar maps, and it should be moni- 
tored in future LSS experiments. We tried to de- 
crease Cj^ further by mode-projecting products 
of templates. We tested several combinations, in- 
cluding (airu)'^, (ugrY , (star)'^ , (seeti)^, (seer)^, 
ugr — riz, ugr — airu, riz — aim, ugr — seeu, 
riz — seeu, airu — seeu, ugr — seer, riz — seer, and 
airu — seerjfl None of the combinations we tested 
decreased the cross-correlation significantly. It is 
important to note that although we were able to 
reduce the cross-correlation significance by 3.8cr, 
the cross-correlation is far too high. We conclude 
that more systcmatics are present that were not 
found. 

For Cj^, stellar contamination is the domi- 
nant systematic. Mode-projecting this template 
reduces the cross-correlation significance by more 
than half a cr, enough to drive the significance 



^This list is not exhaustive of all the possible combinations, 
but it is large subset of the ones we would expect to be im- 
portant, including the combinations between the two seeing 
templates. 
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Fig. 4. — The measured cross-correlation angu- 
lar power spectra between all three QSO redshift 
slices. The crosses are the measured spectra with 
la errors and the solid lines are the predicted spec- 
tra. Note that the predicted spectra are not ex- 
actly zero at smaller scales because the redshift 
distributions have small overlaps. 



Table 5: Systematic templates that signifi- 
cantly reduced upon mode projection the cross- 
correlation CP in the first H-hin (2 < £ < 7), 
along with the new cross-correlation value (with 
Icr errors) and the reduction in the signal-to-noise 
ratio (SNR). 



Systematic 




ASNR (cr) 


Red stellar contamination 


1.59 ±0.19 


-0.8 


{g - r) vs. {u - g) 


1.60 ±0.19 


-0.8 


(i — z) vs. (r — i) 


1.57±0.19 


-0.9 


Air mass {u band) 


1.66 ±0.19 


-0.5 


Seeing (u band) 


1.06 ±0.19 


-3.6 


Seeing (r band) 


1.16±0.19 


-3.1 
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below 2(7. Methods to reduce stellar contami- 
nation should also be developed for future LSS 
surveys. Seeing systematics seem to be having a 
small affect on this cross-correlation measurement 
as well. When we mode-project stellar contamina- 
tion and seeing, we find a cross-correlation value 
of (5.0 ± 2.8) X 10-5 (l.Scr). Final plots for C}^ 
and Cj^ are shown in Fig. [S] 

Note we can use the measured cross-correlations 
to estimate the level of contamination in the red- 
shift slices. For example, we can estimate the 
z01-z02 cross-correlation due to contaminants as 
£{i+l)CP/{27r) ^ {{Sn/n)^) ~ where /eo„t 

is the fraction of contaminants. This implies that 
the contamination levels before mode projection 
for C|^, Cj^, and C^^ in the relevant contam- 
inated e-hinB are (2.2±0.7)%, (3.3±2.0)%, and 
<2.5%, respectively. After mode projection, the 
contamination levels for CP and C^^ reduce to 
(1.8±0.8)% and (2.7±2.0)%, respectively From 
the theoretical cross-correlation plots in Fig. [3l 
it appears we need the contamination level to be 
~1% or less to see fluctuations from /nl ~ 100 or 
0.6% for /nl 10. 

It is important to note that we can divide 
the systematic errors in our photometric quasar 
maps into two categories - contamination and 
calibration. Stellar contamination is the princi- 
pal example of the former since we are selecting 
point sources: it results due to the imperfection 
of the photometric redshift method in identify- 
ing quasars. (In principle, crowding could lead 



2 




5 -1 r T -i 

"o : 

-2 r -i 

-31 . . . J : 

1 10 100 

i 

Fig. 5. — The measured cross-correlation angu- 
lar power spectra between zOl and z02 as well as 
zOl and z03 with systematic templates included. 
The crosses are the measured spectra with la er- 
rors and the solid lines are the predicted spectra. 
Note that the predicted spectra are not exactly 
zero at smaller scales because the redshift distri- 
butions have small overlaps. 



^For C?^ I simply use the first £ bin. 



Table 6: Systematic templates that signifi- 
cantly reduced upon mode projection the cross- 
correlation in the second i-h'm {7 < £ < 12), 
along with the new cross-correlation value (with 
la errors) and the reduction in the signal-to-noise 
ratio (SNR). 



Systematic 


10^^12 


ASNR (a) 


Stellar contamination 


5.5 ±2.8 


-0.7 


Seeing [u band) 


6.6 ±2.8 


-0.3 


Seeing (r band) 


6.7 ±2.8 


-0.3 



to "negative" contamination since the presence of 
a higher stellar densi ty may reduce the com plete- 
ness of the catalog.) iRichards et al. ( 20091 ) claim 
a 96% efficiency in identifying UVX quasars with 
z < 2.2, yet this probably decreases for higher red- 
shift quasart0. This may explain why stellar con- 
tamination arises in the cross-correlation of zOl- 
z03 and not z01-z02. 

The other type of systematic error is calibra- 
tion, where variations in the observing conditions 
or instrument properties propagate to changes in 
the completeness of the quasar catalog. This may 



'''The efficiency decreases to 46% for all higher redshift 
quasars, but it is probably mu ch higher than that for UVX 
quasars. [Richards et al\ l|2009l 'l docs not state the efficiency 
in this case. 
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consist of first-order calibration effects (as in the 
case of photometric cahbration errors, or airmass 
effects that shift the center of the u band) , as well 
as second-order or noise-induced effects such as 
that contributed by the sky brightness. Although 
contamination has been the main focus of pre- 
vious photometric catalogs, it appears that even 
the excellent internal calibration of the SDSS does 
not provide samples with the uniformity on large 
scales needed for /nl studies. 

6. Conclusions 

We searched for various systematic effects 
i n the SDSS pR6 p hotometric quasar catalog 
( Richards et aZ.1 120091 ) . which is the latest catalog 
available. We find that the clustering signal from 
this catalog is contaminated on the largest scales. 
In particular, our zOl subsample (0.9 < z < 1.3) 
and our z02 subsample (1.6 < z < 2.0) exhibits 
a 9.2cr detection of cross correlation on scales 
2 < € < 7, even though quasars from sepa- 
rate redshifts on these scales should be uncor- 
related. We suspect that the cause of this large 
cross-correlation is systematic effects. By mode- 
projecting several candidate templates individu- 
ally, we find that the maps are indeed contami- 
nated by stars, photometric calibration errors, and 
variations in air mass and seeing, with seeing being 
the dominant contaminant. By mode-projecting 
this group of templates together, we were able to 
decrease the significance of the cross-correlation 
by more than 3cr. However, it is still too high to 
provide a competitive constraint on /nl through 
scale-dependent bias. We find spurious number 
density fluctuations of 2% rms, whereas we 
need a contamination level less than 1% (0.6%) 
in order to measure values of /nl less than 100 
(10). We recommend new efforts into improving 
photometry in order to use photometric quasars 
to constrain nongaussianity on large scales. 
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A. Quasar redshift distributions 

In lHo et~al\ ( 2008h . pho tometric redshift dist ributions fi{z) for quasars were constructed using spectro- 
scopic data from 2SLAQ ( Richards et a/.l [20051 ). These redshift distributions determine how the matter 
overdensity 5{x) relates to the quasar overdensity Sq{n) 



f{z)5[n,xiz)]dz . 



(Al) 



We refer the reader to lHo et al\ (|2008[) for the theory behind this method, and we describe our method which 
is similar to and follows from Ho et al. ( 20081 ). 
The expression for /(z) is given as 



Mz) = biz)Uiz) 



W{z,z')[a{z') - l]n,(z')dz', 



(A2) 



where b{z) is the linear bias as a function of redshift, xi^) — Jq c/H{z')dz' is the comoving radial distance, 
and Ili{z) is the probability distribution for the quasar redshifts. The second term in Eq. IA2I is due to 
magnification bias, which becomes important for large redshifts, with the lensing window function W{z, z') 
given for a flat universe by 



1 



X'iz) 



1 



1 



X{z) x{z') 



(A3) 



and a(z) being the logarithmic slope of the number counts of quasars as a function of flux: iV(> F) cc F 
In this appendix, we describe the method of calculating the various factors in Eq. IA2l that comprise f{z). 

We rely on spectroscopic data to compose redshift distributions. Specifically, we use spectroscopic quasars 
(spectro-QSOs) from an area with high spectroscopic coverage to construct a preliminary probability distri- 
bution Hi. prelim and a{z). We use spectroscopic data from the 2SLAQ survey, which contains 8389 spectro- 
QSOs over its total region of view. We restrict ourselves to using spectroscopic data from five rectangles on 
the sky with declination range -01°00'36" - 00°35'24" and right as cension ranges 137° - 143°, 150° - 168°, 
185° 193°, 197° - 214°, and 218° - 230°, the same as those used in lHo et all (|2008f ). for the quasar analysis. 
These rectangles in particular have high spectroscopic coverage and contain 5383 quasars. Since we also 
need photo-zs for the quasars to construct probability densities, we only use quasars that have matches in 
the RQCat, decreasing the number of objects to 3443. 

We calculate n,;.prciim(z) for each photometric redshift slice using a kernel density estimator of the form 



n 



i.prcl 



1 



1 



k=l 



(A4) 



where Nq is the number of spectro-QSOs (matched with a photo-QSO in RQCat) in the photometric redshift 
slice, Zk is the spectro-z of the fcth-matchcd quasar, and a is the slice's kernel width, a is chosen to be smaller 
than any real features in Ili.proiim yet large enough to smooth out shot noise. Table [7] lists Nq and a for each 
redshift slice, and Fig. [5] shows a plot of Ili^proiim for each slice. We also calculate a, the logarithmic slope 
of the number counts of quasars in terms of flux, by creating a histogram of number counts in terms of the 
PSF magnitude in the ly-band around g = 21 and calculating the actual slope around this value. This value 
for each redshift slice is also listed in Table [71 

The expression for fi{z) in Eq. I A2 1 requires the true probability distribution Iii{z). However, since a — 1 
is small, the second term is subdominant to the first and we can substitute Hi(z) with Hi^proiim(z) in the 
second term, giving us 



/,(z) 6(z)H,(z) 



W^(z,z')[a(^')-l]n,;,preum(z')dz'. 



(A5) 
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Table 7: Properties of Hi proiim for the 3 QSO redshift slices; Nq is the number of spectro-QSOs (matched 
with a photo-QSO in RQCat) in the photometric redshift slice, a is the slice's kernel width, and a is the 
logarithmic slope of number counts as a function of flux. 



Label Nq a a 

zOl 599 0.08 0.48 

z02 779 0.10 0.63 

z03 236 0.06 0.64 




Fig. 6. — The preliminary redshift distributions for the QSO photometric redshift slices zOl (solid), z02 
(dotted) and z03 (dashed). 
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Ili{z) cannot be similarly substituted for in the first term, so we must estimate b{z)Ili{z) using LSS data. 
We estimate b(z)D{z) as nearly constant, writing 



b{z)U,{z)D{z) = A,m, 

prelim ; 

(A6) 

where Ai is a constant. Generally, Ai would be a piecewise function of z with several breaks in order 
to estimate Ai precisely; however, we were able to estimate Ai as a constant function with very small 
uncertainties. 

We estimate Ai in each redshift slice by constraining its effect on the quasar clustering. We begin 
by estimating the q uasar correlation function Wi(0) in each redshift slice using the method presented 
Landv fc Szalavl (1993 ). Specifically, we use the estimator w^^O) along with its variance given in 



iLandv fc Szalavl ( 1993 ). given by 



DDiO)-2DRiOHRRiO) 
^ ' RR{9) ' ^ ^ 

where DD{d), DR{d), and RR{9) are properly normalized histograms of the number of data-data pairs, 
data-random pairs, and random-random pairs, respectively, binned in terms of angular separation. We use 
the following expression for the Poisson uncertainty in our estimator 

''^^[^4] = —, ^ + ^TTTTp: , (A8) 

nrljir — 1)RR n[n — 1)UU 

where n [ur) is the number of data (random) points used to calculate DD{9) {RR{9)). Note that for the 
stochastic calculations we use 25 simulations with the number of random points equal to twice the number of 
data points. We calculate Wi{9) in 10 logarithmic bins of equal (logarithmic) size in the range 0.3° < 9 < 6°. 
We avoid estimating Wi{9) at smaller angles due to nonlinearities and at larger angles due to possible non- 
gaussianity effects on large scales. We compare this estimate of Wi{9) to the expression 

/"OO 

w{9) = / dkkP{k)F{k,9) 
Jo 

F{k,9) = kY,[^^]Piicos9)[Wi{k)]\ (A9) 



1=1 



where we substitute Eq. lA6l for the first term in Eg. lASI to calculate Wi{k) in terms of A. We include rcdshift- 
space distortions in Wi{k). For calculating F{k,9) we use the Limber approximation for k > 0.0155 Mpc~^ 
by converting the sum to an integral, replacing Pi{cos9) — ^ Jo{l9), and replacing Wi{k) ^J-k /2lf{l/k) /k. 
We can write Wi{9) as a sum of terms linear and quadratic in Ai, which allows us to fit for this parameter 
for each redshift slice. We also add to this model a constant term Bi due to systematics effects for which 
we also filEI. In Table [2 we list estimates for the Ai, from which we derive the redshift distributions fi{z) 
shown in Fig. [TJ 



We verified that for k > 0.017 Mpc ^ the summand vanishes for I < 40, which removes all terms over which the Limber 
approximation is not valid. 

Note that wo did not include Bi for redshift slice z03 because it caused Ai to not be constrainable. 
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